Set Up the Azure Blob Connector

The Azure Blob Data Connector allows you to ingest documents from your Azure Blob Storage into your project.

Prerequisites

To use the Azure Blob connector, you will need either:
  • For Storage Access Key authentication:
    • An Azure Storage Account Connection String.
    • The Container Name within your Azure Storage Account from which you want to ingest data.
    • You will need to have created a container in your Azure Blob and uploaded your files there.
  • For Shared Access Signature (SAS) authentication:
    • A Shared Access Signature (SAS) URL for your Azure Blob container.
  • You can find these details in your Azure Portal under your storage account.

Steps

  1. Select Azure Blob Data Source Navigate to the Data Sources section of your project. Click Add data source and select Azure Blob from the available library.
  2. Provide Connection Details and Choose Authentication Method Choose your preferred authentication method and provide the corresponding details:

    Option 1: Storage Access Key Authentication

    This method requires your storage account’s connection string and the container name.
    1. Select Authentication Method: Choose the Storage Access Key option when setting up the connector.
    2. Locate Credentials:
      • In your Azure Portal, navigate to your Azure Blob Storage account.
      • Go to Access Keys under the Security + networking section.
      • Save the Connection String from one of the keys (e.g., key1 or key2).
    3. Enter Details:
      • Connection String: Paste the saved connection string.
      • Container Name: Enter the exact name of the container you wish to ingest.
    Ensure you provide the correct container name to synchronize the intended content.

    Option 2: Shared Access Signature (SAS) Authentication

    This method uses a generated Shared Access Signature (SAS) URL for granular access.
    1. Select Authentication Method: Choose the Shared Access Signature (SAS) option when setting up the connector.
    2. Generate SAS URL:
      • Go to your Azure Blob account in the Azure Portal and select the desired storage account.
      • Navigate to Containers and select the specific container you want to sync.
      • Click on Shared access tokens or Generate SAS.
      • Configure the SAS token with an appropriate expiration time and permissions (ensure read and list permissions are granted for ingestion).
      • Click Generate SAS token and URL.
    3. Enter Details:
      • Connection URL: Copy and paste the generated Shared Access Signature (SAS) URL.
  3. Monitor Ingestion Status
    Once you provide your connection details, the page will refresh to display the ingestion status. You can view the current ingestion status by clicking on the data source again.
Ingestion can take some time depending on the volume of data.

Next Steps

After your data has been successfully ingested, the Azure Blob data source is ready to be used with an Agent.